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ABSTRACT 



A hierarchical filing system provides a cataloging of 
data stored in various locations within a memory de- 
vice. An upside-down tree type structure provides a 
downwardly expanding cataloging structure wherein 
directories provide for further branchings. A branching 
from a directory is terminated when a file is reached. 
Each directory is assigned a unique directory identifier 
value. Then, each file or directory is coupled with the 
directory identifier value of its parent to provide the 
interconnection necessary to form the cataloging struc- 
ture. The complete cataloging structure is organized in 
the leaf nodes of a B-Tree structure and distributed in an 
ascending order of the key values to provide a system- 
atic search for a given key. Each file is capable of stor- 
ing a predetermined number of location description 
information when data is segmented into noncontigu- 
ous segments in memory. A file extents record is used to 
maintain record of the further segmentation. File loca- 
tion information is kept in the form of file extents de- 
scriptors in the leaf nodes of the separate FUe Extents 
B-Tree. This extents information is sorted, in an ascend- 
ing order based on a key comprised of a unique file 
number of a file relative starting block location of the 
file extent 

6 Chunks, 3 Drawing Sheets 
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which are used to identify and catalog various flies in 

HIERARCHICAL FILE SYSTEM TO PROVIDE the catatoging system, are also used to orguize the 

CATALOGING AND RETRIEVAL OF DATA catalog in the leai" nodes of the B-Tree. The keys are 

placed in an ascending order for systematic access. 

This is a continuation of application Ser. No. 924,802 S Further, the B-Tree grows by using left rotates and left 
filed Oct 30, 1986 now abandoned. splits with insertion of catalog information about new 

files from the right to maintain a balanced tree. 
BACKGROUND OF THE INVENTION y^^ 4 me < s ^ u j, store d, additions, deletions and 

1. Field of the Invention modifications will typically result in non-contiguous 

The present invention relates to the method of storing 10 physical storage of the data in the memory device. Each 
and retrieving data using a computer, and more specifi- of the contiguous segments of the file is known as a file 
cully to a hierarchical filing system. exteut A record of the physical location of the extents 

2 Prior Art for a particular flle is maintained in one or more extenu 

In a compute system, Information is typically stored records. The hierarchical filing system uses a file extents 
M ttgnaJs on various storage mediums, such as magnetic IS list to maintain the extents records of the various fUes on 
tapes, disks, semiconductor devices, etc. As storage the memory device. 

densities increased with advances in storage device The present invention maintains the first extents re- 

technology, it becajie possible for a device to store cord of a file in the cataloging structure, but any further 
much more information than previously. extents records are m ai n tai n ed in a separate flle extents 

When information is stored on a device, it is cata- 20 list This file extents list is also implemented in a second 
loged so that the same information is later retrieved B-Tree structure. 

when desired. Normally, a unique code name is attrib- BRIEF DESCRIPTION OF THE DRAWINGS 

uted to a particular body of data to differentiate it from o««-i/«v~r 

others. To retrieve a desired body of data, an appropri- FIG. 1 is a representation of a prior art flat filing 

ate code name associated with that data is used, wherein 25 system. 

the device searches for that code name and retrieves the FIG. 2 is a representation of a hierarchical filing 

desired data when that code name is found. system of the present invention. 

It is well-known in the prior art that each separate FIG. 3 is a representation of a B-Tree structure of the 

body of data is termed a file and the cataloging of these present invention. 

macfflac>rtcetoteni^Ming.Typk»Uy,cod«najne« 30 HO. 4hi representation of contents of a node for 
associated with particular data contain pointers which the B-Tree structure of FIG. 3, 
point to areas in memory reserved for mass storage. The FIG. S is a representation of a left-split anda Jeft- 

various code names and their pointers comprise the rotate operation of a B-Tree structure of the preferred 
cataloging system. When high-density storage devices embodiment 

are used, millions of bits of information are capable of 35 FIG. 6 is a representation of a cataloging structure of 
being stored on such a device, which permits hundreds, the preferred embodiment and an organixationof the 
thousands, and even millions of files to be created. To cataloging structure in various nodes of a B-Tree. 
search through these files in a serial fashion to look for FIG. 7 is a representation of a volume allocation 

a specific file is time-consuming. mapping in a filing system of the preferred embodiment. 

It is appreciated that what is needed is a filing system 40 FIG. 8 is a representation of a file extents list of the 
for a high-density storage medium which rapidly preferred embodiment and showing various flle extents 
searches and retrieves the desired file stored. Further, in memory. 

with the advent of the personal computer (PC) and the FIG. 9 is a representation showing the file extents 

small business computer, where physical sixe is a con- organization in the Catalog and Extenu B-Trees of the 
cern, it is desirable to have a filing system which may be 45 preferred embodiment 
implemented in a lesser line of program, yet be effec- DET AILED DESCRIPTION OF THE 

tual. PREFERRED EMBODIMENTS 

SUMMARY The present invention describes a method of storing 

A method for providing a Merarchical filing system is 50 and retrieving information using a hierarchical filing 
described. The hierarchical filing system provides a system. In the following description, numerous specific 
catalog of the data stored in various locations within a details are set forth in order to provide a thorough 
memory device. Typically, one cataloging structure is understanding of the present invention. It will be obvi- 
used to organize a volume of memory. ous, however, to one skilled in the art that the present 

The cataloging structure of the hiearchical filing 55 invention may be practiced without these specific de- 
system is provided by an upside-down tree type struc- tails. In other instances, well-known methods have not 
tire wherein there is a starting directory which oper- been described in detail in order not to unnecessarily 
ates as a root directory. Other directories and files ema- obscure the present invention, 
nate as off-spring. A plurality of descendant levels Referring to FIG. 1. a prior art flat filing system 10 is 

branch downward to provide the hierarchical structure 60 shown having a directory 11 and files 12-13. For ease of 
of the catalog. The cataloging structure contains the understanding, a directory is shown pictonally as a 
location information of where the actual data is stored. folder and a file is shown as a sheet of paper with a 

The fite cataloging system is implemented using a folded corner. The pictorial representation applies well 
B-Tree. The cataloging information is kept in the leaf to an analogy of placing papers into folders (files into 
nodes of the B-Tree. The non-leaf nodes (index nodes) 65 directories). In the prior art system 10, there is present 
of the B-Tree contain information that allows searching a single directory 11. which contains locator informa- 
for particular catalog information by using the code tion for files 12-13. Each of the files 12-13 contain data 
namVor key of the corresponding file. Key values, which would be associated with a specific body of 
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stored information. In this particular example of a prior 
art system 10, to acceu file 15, a serial search is made 
through directory 11, until the file address of file IS is 
located, such sequential search resulting in considerable 
lapse of time when substantial numbers of files exist in 
the directory 11. Although in this hypothetical exam- 
ple, directory 11 maintains pointer addresses to four 
files 12-15, directory 11 will continue to store addresses 
of subsequent files in a sequential fashion. 

FIO. 2 illustrates the architecture of the Hierarchical 
Filing System (HFS) of the present invention. This 
twirticuhu- HFS 16 include* a root directory 17 and files 
21-24. The HFS 16 also includes directories 18-20. 
Esch directory is capable of containing files, as well as 
other directories such as directory 18 containing direc- 
tory 20. Each directory 1* a branching node, allowing 
for none or a plurality of sub-branching nodes. Each 
directory contains information which permit* the 
branching to occur. The actual data is stored in the files 
21-24, Because each file is a termination node, it does 
not need to maintain further branching information. 
Instead, each file stores the actual data. Therefore, the 
directories 17-20 maintain branching information, 
while file* 21-24 contain the stored data. 



10 



IS 



20 



stored only in leaf nodes 35-38. The internal nodes 
32-34, also known as index nodes, contain pointer* to 
other nodes scch that these index nodes 32-34 provide 
an index for accessing the data records stored in the leaf 
nodes 35-38. Each record 39 includes a key 40 and an 
information segment 41. Within each node, the records 
are maintained so that their keys are in ascending order. 
The example B-Tree 31 of FIO. 3 contains hypothetical 
keys which have been inserted to show the structure of 
the tree, and the relationship between index nodes 
32-34 and leaf nodes 35-38. Leaf node 35 contains key 
values 4P and 50. The first key of a node is also repre- 
sented as a key in its ascending node. Therefore key 48, 
which is the first key of leaf node 35, is also represented 
as a key within index node 33. Key 53, which is the first 
key of leaf node 345, it represented as the second key of 
index node 33. Also, because key 48 la the first key 
within index node 33, it is again represented as a key 
within index node 31 This pattern is repeated for each 
leaf node 35-38 and each ascending index node 32-34 
for a B-Tree structure. Although FIO. 3 shews only 
three leveU and two keys per node, any number of keys 

per node, as well as any number of levels, may be 
chosen for a particular B-Tree structure. B-Tree 3 of 



SsS ZZXJ^E*** faahion 25 mi u drawn" iT .Typical example for illu.tr.. 



so that serial search for the files is not necessary. As- 
sume in the example of FIO. 2 that access to data stored 
in file 23 is desired. A search of directory 17 reveals that 
two possible paths exist in seeking the address of file 23. 
One path from directory 17 leads to directory 18 and 
the other path leads to directory 19. The desirable path 
is to directory 18, at whkh point there are again two 
paths. The desirable path from directory 18 leads di- 
rectly to file 23. Although this example is simplistic 
because of the miniscule number of files shown, one can 
appreciate the file search time saved when a substan- 
tially large number of files are present 

Further, as an example, if file 22 had been chosen, the 
path from directory 18 would have led to directory 20, 



30 



35 



tion purpose only. 
When a data record is needed, the key of the desired 

record is provided. The search begins at the root node, 
which is also an index node. A search is performed 
within the node until the record with the highest key 
that is not higher than the search key is reached. As- 
sume in the hypothetical example of FIO. 3, that data 
with key 59 is to be selected. The search commences at 
the root node 32, wherein key M is selected because the 
value 54 is the highest key that b not greater than the 
search key itself. The pointer of key 54 selects index 
node 34, wherein the search continues within index 
node 34 Again, key So" is chosen because it is the high- 
est key that is not greater than the search key itself (the 



path rrom directory u wouia nave ieo to oirecwry «». "=""'«" — T— Ii™rt» ~",,l^ ts. „„;„*«■ 
a? which point two ^. exi« from dir^ry 20. The « next ^J^JS^Z^SSSS^. «E 



desirable path to file 22 from directory 20 then would 
have been chosen. HFS 14, although shown in a partic- 
ular form in FIO. 2. may have any number of levels 
(branchings) down from the root directory 17 as well as 
any number of branches from a particular directory. 45 
However, it is to be noted that all data is stored in the 
represented files 21-24 which are all located at the 
termination nodes of HFS 16. 
In actuality, the cataloging architecture of the pre- 



of key 54 in index node 34 selects leaf node 37, Within 
leaf node 37, another search is made to identify search 
key 59. When search key 59 is found, its associated 
information (date) is used. 

A particular pointer in an index record leads to an- 
other node one level down in the B-Tree 31 For exam- 
ple, node 32 to node 34. The process continues until a 
leaf node is reached whereupon its records are exam- 
ined until the desired key is found. If the desired key is 



In actuality, the cataloging arcniieciure oi me pre- •»•»» •-"" «~ — — •- * — — - i.JL__ 



35 



thin the search key is reached or when all the record* in 
the leaf node have been examined. The key values may 
be numeric alphabetical or alphanumeric 

Referring to FIO. 4, it shows the structure of any of 
the nodes of a B-Tree of the present Invention. Each 
node 42 includes a node descriptor segment 43, records 
segment 44, record offset segment 44, and can have a 
free space segment 45. Each node 42 begins with a node 
descriptor segment 43. NDNRECS 58 contains the 



tion information in the HFS 14 structure. The catalog 
entries for file* 21-24 contain pointers which provide 
locator descriptions to locate places in storage area 
where actual stored date is kept 

B-TREE 

The HFS of the present invention is implemented 
using two B-Tree structure* in the preferred embodi- 
ment, the Catalog B-Tree and the File Extents B-Tree. ««»... K «« ~»—-»- — - ■•— -—--.«, xrrvrvPF u. 
A B-Tree structure is well-known in the prior art and is 60 number of records currently to the J^"°™"4 
described in The Art of Computer Programming Volume indicates the type of node. either leaf or index node. 
?^rt£ a and Searching)- by Donald E. Knuth; at NDHEIOHT 57 indicates the height of the node m the 
Seffi^lklnffiJLy Trees"; pp 471-479 tree, wherein leaf nodes are ^ as Jevel 1. and the 
m™ The nodes of a B-Tree contain records, wherein index nodes just above them are at level 2, etc. 
^^^l^Tc^into^on,*** 65 NDBLI1^52andr^FLr^lare^w«hB^ree 
Sters ordata, and a key associated with that record. nodes as a way of quickly -^fj*"^ Si'Sff 

Refer^toa toFIO 3. a hypothetical B-Tree is illus- of the various nodes at a given level. For each node. 

tra^edTbasic feW of XeB-Tree 31 is that data is NDBLINK 52 contains a pointer to the previous node. 
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and NDFLINK 51 contains a pointer to the subsequent 
node at the same level. In FIO. 3, NDBLINK for node 
36 would point to node 35 and NDFLINK for node 36 
would point to node 37. Therefore, NDBLINK 52 and 
NDFLINK 51 are means of locating adjacent nodes 5 
without first reversing back up the B-Tree. 

The records segment 44 contains the B-Tree's re- 
cords, each with its key and pointer or data information. 
In this particular example, there are two records 60 and 
61. The records in a node can be of variable length. For 10 
this reason, offsets to the beginning of each record are 
needed. The records segment begins immediately fol- 
lowing the node descriptor segment 43. The records are 
followed by a free space segment 45, which is basically 
the unused space of the node. Therefore, free space 13 
segment may not exist in tome instances. The record 
offset segment 46 at the end of the node contains the 
offset information for record! 60 and 61. Offset 68 con- 
tains offset information for record 60 and offset 67 con- _ -,,«,,-« 
tarns offset information for record 61. Offiet 66 contains 20 identifier known as the directory Idertifler (DirlD) 



data leftward and inserting new data at the right ex- 
tremities helps keep the B-tree balanced. Because the 
HF5 of the present invention is structured to have the 
ascending nodes organized in a rightward direction, the 
balancing is maintained even though the routes and 
splits are made toward the left direction. It is appreci- 
ated that right splits and route operations, or balanced 
insertions using both right and left operations can be 
used as well. Although the preferred embodiment uses 
and attempts to maintain a balanced B-Tree for search 
efficiency, most any B-Tree structure can be used, in- 
cluding unbalanced B-Tree. 

CATALOG TREE 

Referring to FIO. 6, a hypothetical catalog 90 is used 
to illustrate the Implemenution of cataloging of the 
preferred embodiment The structure 90 has a root di- 
rectory 91 named "Volume". Each directory of the 
preferred embodiment is assigned a unique numerical 



the offset necessary to determine free space 60. Thus the 
record segment 44 builds downward into the free space 
segment 45, while the record offset segment 46 builds 
upward into the free space segment 45 from the oppo- 
site end. 

If node 42 is an index node, then each record 60 and 
61 is comprised of a key and pointer information. Fur- 
ther. NDFLINK 51 and NDBLINK 52 would contain 
adjacent index node linking pointers. If node 42 is a leaf 



The root directory of catalog 90 has DirlD value of 2. 
Root directory 91 ha* three branches comprised of 
directory 92 and files 93 and 94. Directory 92 has a 
name of "Folder" and a DirlD value of 29. In turn, 
23 directory 92 has two branches comprised of files 95 and 
96. Files 93-96 are named "A", "B", "C" and "D", 
respectively in this example. The architecture of the 
directories and files follows the HFS structure as previ- 

___ m r . ously explained in FIO. 2. The complete cataloging 

nodeTthen each" record 60 and 61 is comprised of a key 30 structure 90 is stored as dau records in various leaf 
and dau information. NDFLINK 51 and NDBLINK nodes of the B-Tree of FIGS. 3 and 4 known as the 
52 would also contain leaf node linking pointers. It is catalog B-Tree. It is appreciated that the cataloging 
also appreciated that although a particular format is structure 90, although a tree, is in itself not a B-Tree. 
illustrated for node 42, the format may be modified The form ofstnicture 90 b actually stored in the various 
readily to include other types of information. Also, in 33 leaf nodes of a B-Tree. It is to be appreciated that the 
the preferred embodiment dau information in the leaf cataloging structure 90 not be confused with the previ- 
nodes of the HFS catalog B-Tree is used to address ous description of the B-Tree. Catalog 90 and the B- 
locations in memory where the actual daU is stored. Tree structure are two separate and distinct structures. 

Referring to FIG. 5, a specialized B-Tree expansion The hierarchical structure of the catalog 90 is imple- 
architecture as implemented in the preferred embodi- 40 mented as a B-Tree structure and stored as dau records 



ment is shown. A node 70, which is equivalent to node 
42 of FIG. 4, is shown having pointers to two lower- 
level nodes 71 and 73, which may be index or leaf 
nodes. Although only two nodes 71 and 73 are shown at 
the lower level, any number of nodes may reside at this 43 
lower level. Also in this particular hypothetical exam- 
ple, nodes 71 and 73 are only partially filled. 

For a B-tree to maintain Its balance, records must be 
kept uniformly spaced within the hierarchical structure. 
An unbalanced tree will result when records are not 30 
maintained uniformly in each node or nodes are heavily 
sucked toward one branch of the B-Tree. The pre- 
ferred embodiment uses a technique of left route and 
left splits to provide movement of records from one 
node to another to maintain a balanced Tree. When 33 
records are to be transferred to another node, the left 
route operation is used. In this instance, records in node 
73 are left routed to its left adjacent node 71, as shown 



in leaf nodes of a B-Tree similar to that of FIGS. 3 and 
4 

The hierarchical catalog structure 90 is stored in a 
storage device as shown by a memory map 97 of FIG. 
6. Cataloging map 97 is comprised of three possible 
types of records: directory records 100, file records 101, 
and thread records 10X Each record 100-102 is com- 
prised of a key 103 and information segment 104, as 
earlier described in the description of a leaf node of a 
B-Tree. The key 103 of each record is comprised of a 
value 10S and a name 106. .The key 103 of a directory 
record, such as that of 91 'and 92, is comprised of its 
directory name 106 and its parent directory's DirlD 
value 105. A information segment 104 of each directory 
record, such as that of directories 91 and 92 is com- 
prised of the directory's DirlD value 107. For directory 
92, the directory's DirlD has been given the value of 29, 
and has a name of "Folder". The parent DirlD of re- 
cord 92 has been given the value 2 because directory 92 



by arrow 77. -««»»• 

If another node is needed, such as when records in 60 is an offspring of directory 91 tn the structure 90. Direc- 
node 73 must be rotated and node 71 cannot accommo- tory record 91 has a directory DirlD value of 2, with a 
date records from node 73, a left split operation is used corresponding name of "Volume". Because directory 
to insert node 72 to the left of node 73, between nodes 91 is a root directory, the parent DirlD value has been 
71 and 73. In this Instance, node 72 is inserted to link given the value of 1, wherein the value 1 refers to the 
node 71 and node 73, as shown by arrows 78. When 65 foundation or the filing system itself, 
node 72 is inserted, appropriate pointer links will be A file record, such as file records 93-96, is also com- 

esublished with its index node 70 as well as adjacent prised of a key 113 and an information segment 114, 
link pointers for nodes 71 and 73. Continually moving wherein key 113 is also comprised of a parent DirlD 
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value and a name. However, in the information segment 
114, the descriptive location information for the actual 
stored file data is maintained as well as a unique file 
number. The information segments 114 of file records 
93-96 contain the descriptive location of the actual 
stored data information. 

File record 94, having a file name of B, and file record 
93, having a file name A. both have a parent DirlD 
value of 2. The parent DirlD value of 2 signifies that 



files A and B are direct offsprings of directory "Vol- 10 one indicated above. 



about the physical location of the data contained in the 
desired file. 

It will be appreciated that the specification of the file 
of the above example could start with the DirlD value 
of any directory on the path from the root to the desired 
file, and would then consist of this DirlD value and the 
sequence of names of the directories on the balance of 
the path from that directory to the desired file. The 
search mechanism followed is an obvious variant of the 



time" having a DirlD value of 2. File 95, having a name 
C, and file 96, .having a r.irr.e D, have parent DirlD 
values of 29, which reflect the origination of files C and 
D as offsprings of Htrectory 29 labeled "Folder", having 



ic mwvuH vw www » *•• 

Although cataloging structure 90 is a simplified struc- 
ture and FIG. 6 only shows the presence of a single 
structure having a single root directory 91, a cataloging 
structure may be enlarged manyfold. The preferred 



D as oJUpnngs oi nireciory <ty laocrea roiwa , i*™» structure may oe euuujjw luaujnun*. »«™ |.««.~ ■-- 
a DirlD value of 29. Therefore, by looking at any file or IS embodiment uses one HFS cataloging structure per 
. ji«», nn . ~v.ni 1 , v*v ina the stored information ..._»» A**Ar» «v»h h > disk. However, such a disk 



a directory record's key 103, the stored information 
provides the identification of the name of that particular 
record as well as the DtID value of the parent node. 
To provide the interconnection of the different 



memory device, such as a disk. However, such a disk 
can be partitioned and an HFS catalog assigned to each 
such partition. 
The catalog records of structure 97 of FIO. 6 are 



to provwe tnc inwrconnccuua ui mw ™™w» xne catalog recoro* oi wruc™™ »/ «• «-»>». w «v 

branches, a thread record 102 is provided for each di- 20 storw j M the dau records in the leaf nodes 42 of FIO. 4 
rectory. The key of a thread record contains a DirlD of t c^og B-Tree. These records are inserted and 
value and a null-name, which is equivalent to having no maintained in the catalog B-Tree in ascending alphanu- 
name at all. In the example of FIO. 6, thread record 10* mefic order -t^ if the leaf nodes of the B-Tree are 
provides the connection between the directory traversed from left to right, the data records will be 
"Folder" and files C and D. In the key 111 of thread 23 encountered m the order shown in structure 97 of FIO. 

record lot, only the directory DirlD value of "Folder" , «.< , i_».i_ »k- wma i« hmmUm ordei 

is given. In the information segment 112 of thread re- 
cord lOt. the DirlD of "Folder"^ parent and the direc- 



6. This order nH"**'"* the records in ascending order 
first by the DirlD value part of the key. Then, among 



records with the same DirlD value in their keys, the 



tory'sname "Folder" are given. Therefore, when file C, ^^ h ^phabetteal on the name part of the key 
having a parent DirlD 29 attempts to link to its ™medi- » , t j, ^ Appreciated that other pertinent intern 



ate parent directory 92. which has a DirlD of 29, the 
thread record 100 provides the name (Folder) of the 
parent directory 92. as well u the parent DirlD value of 

directory 92, which is equal to 2. 



It is also appreciated that other pertinent information 
may be stored in the various records besides what has 
been disclosed in FIO. C For example, directory and 
file records of the present invention maintain flags, date 



rectory 92, which is equal to 2. and time cfciwtkm of tlie directory or the file, as well 

Equivalent^ thread record 109 provides thename 3» „ t £ d \£ ^ tfan, of last modification. Abe, file re- 



as the date and time of last modification. Abo, file re- 
cords include such items as flags for locking the file, 
values to set logical and physical end of files, and size of 

the file 



FILE EXTENTS TREE 

As already noted, the catalog B-Tree's file record of 
a particular file contains informatioii about the locations 
in the memory device where the file's data is stored. 



(Volume) of directory 91 as well as its parent directory 
DirlD value for the three offsprings 92-94 of directory 
91. By having directory records 91-92, 'file records 
.93-96, along with thread records 100-109 for each di- 
rectory, the cataloging structure 90 is interconnected « 
Into a HFS, wherein the descriptive location informa- 
tion for the actual stored data is stored in file records 

91-92 as shown in the structure 97 of FIO. 6. 

By implementing the <***&* «*«ctu« 90 using £ --»— - j^- ^^ „,„, a \equentially 
B-Tree .Picture, the J^archicd f^SS^^Tr^ numb^dwllection of blocks. A series of contiguous 
structure 90 U easily stored in theleaf nodes or a B-Tree """\^ y^ h ^^ „ extent Ideally, a file would 
ofthe earlier description. For example, when file C is to ^^S?l!X^)!S»S^wmaoty 
be^xesKdbyacomputer.thesystmwiUimpletnenta ^^^^^^^l^^T^^ 

^^-SassSSSS - SSSESSsassass 

foUowed by "Folder", and finally "C. The search 
begins by finding the directory record in the Catalog 55 
B-Tree that corresponds to "Volume". Its name is 
"Volume" and since It is the root, its parent DirlD 
value is 1. The catalog B-Tree is searched for a direc- 
tory record with key <1> Volume; thus, directory 
record 91 is found. Its information segment then pro- 60 files data. ^^^ iBUM ri on « or«- 

"< gtiS$J& lESo Se t£?£«S. Tree, and records the volume Ration -*•*«- 
wh£e information segment provides this directory;, various "tents that ™^*L™»; $£%&£% 
DirlD value of 29. Thus now a »e«ch of the B-Tree is 65 any memory allocation system ^^^^"J* «* 
m«t.t« fi«i the d«t> record with key <29>C. This tents record ofthe present invention, a specific memory 
EZ££ £^^2Z*T*?£ riord 95. al.oc.don system is <^ *J££* «- ■» * 
whose information segment contains the information tents record of the preferred embodiment. 



\jmaK \ji y t v u w H w w i wsi ■ ■— ■— » -— — : 

particular file are usually stored in more than one ex- 
tent, separated into non-contiguous sections on a vol- 
ume. Each file extent can be identified by an extent 
descriptor. Thus, the complete location information of a 
particular file is a sequential extents list consisting of the 
extent descriptors of the various extents containing the 



Page 0010 of 0013 



Apple Computer Inc. Patent : 4_945_475 



4,945,475 



10 



Referring to FIG. 7, a memory volume 120 which is 
a portion of > memory device, such as a hard disk, is 
shown. Volume 120 is segmented into a number of logi- 
cal blocks 126. Typically, each logical block 126 is 
comprised of a predetermined fixed number of bytes, 
such as 3 12 bytes for the preferred embodiment. A fixed 
number of logical blocks starting at block and ending 
at block n is reserved for volume information. The 
balance of the memory device starting at block n+1 is 
available for data storage and this storage area is sepa- 
rated into allocation units, wherein each allocation unit 
is comprised of one or moie contiguous logical blocks. 

Volume 120 includes four areas 121-124 System 
itnrt-up area 121 contains certain configurable system 



10 



first on the file number field and then on the file relative 
position of the starting block. This allows efficient 
search through the B-Tree for the location information 
of data at a particular file relative position. 

In actuality, the preferred embodiment stores three 
extents descriptors, base plus two subsequent extents 
descriptors, the information data segment 114 of the 
file's catalog B-Tree record such as 9* of FIG. 6. There- 
fore, in the example of FIO. 8, extent descriptors 125a. 
126a and 127a are kept in the information segment of 
the cataloging structure and extents 128a-l3la are kept 
in the File Extents B-Tree as shown in FIG. 9. Permit- 
ting limited extent information to be kept in the data 
segments of a cataloging structure permits faster access 



parameters which are well-known in operating a disk or 15 t0 data. Only when a file contains four extents or more, 

other memory devices. Volume information area 122 ^ j t nee< j t0 consult the File Extents B-Tree. It should 

contains information regarding the housekeeping pa- ^ appreciated that the number of extents which are 

rameters of the volume, ;uch as number and size of each kcpt m (j, e (y e > s Catalog B-Tree record without using a 

allocation unit Volume bit map 123 main t ain s record of pjj e g^ents B-Tree is arbitrary and can be changed 

each allocation unit on the volume 120 and uses a bit 20 y^^t departing from the spirit and scope of the in- 

map to designate use or non-use of each allocation unit vention. 

Commencing at block n+1, a file content area 124 j^ referring t o FIG. 9, it shows a catalog file re- 
extends to the end of the Volume 120. File content area ^^ 145 ^ FUe £,^1^ B-Tree records 143 and 144. 
124 is separated into a number of allocation units, a, erased « the structure of B-Trees of the present 
wherein each allocation unit is comprised of a fixed 25 mvemtoni ^.j, record 143 and 144 is comprised of a 



number of logical blocks. While the bit map 123 main' 
tains volume space management, it does not provide file 
mapping. The file mapping function is provided by the 
file extents lists. 

Referring also to FIG. 8, a portion of file contents 30 
area 124 is shown containing information attributed to a 
file labeled file E. In this hypothetical example the en- 
tire contents of file E are separated into seven extents 
125-131. The first portion of the file is stored in base 
extent 125, the subsequent portions of the file are dis- 35 
tributed accordingly in extents 2-7 which are labelled 
126-131. File E has seven extents 125-131 which are 
not physically contiguous. To maintain file extents in- 
formation an extent descriptor 140 is used for the base 
extent 125 and each of the subsequent extents 126-131 40 
of file E. 

Extent descriptor 140 is comprised of a starting allo- 
cation unit number 141 and number of allocation units 
142. File E extents list 135, which is comprised of seven 



key 148 and 149 and extents list 146 and 147, respec- 
tively. To locate a certain portion of the data of a partic- 
ular file, first the Catalog B-Tree is searched for the 
corresponding file record. From this file record's infor- 
mation segment the file number is extracted. Also, the 
first three extent descriptors in the information segment 
of the catalog B-Tree file record are examined. If the 
required file data is contained within the corresponding 
extents, then the location information is. now readily 
available If however, the desired file date is located in 
extents beyond the three in the catalog's file record, 
then a search is made of the File Extents B-Tree using as 
a search key the file number and the computed file 
relative block position of the desired data. This search 
will lead to the file extent's B-Tree record containing 
the desired location information. 

The example using file E is comprised of 22 blocks 
and having an arbitrary file number equal to 20. The 



ss*s^3CKT?s=s5a=? « si.'srsssiisssssi 



to the address and length of each extent 125-131 of file 
E. For example, the fourth extent 128, which has a 
starting allocation address of 189 and is only two alloca- 
tion blocks long, has a value of 189 in field 141 and a 
value of 2 in field 142 of descriptor 128a. 

Extents descriptors of all files in a volume are main- 
tained in the present invention in the data records con- 
tained in the leaf nodes of B-Tree such as of FIGS. 3-5. 
This tree is known as the File Extents B-Tree and is a 
separate B-Tree from the earlier described catalog B- 
Tree. Each data record of this extents B-Tree consists of 
a key and an information segment as before in the dis- 
cussion of FIGS. 3-3. The information segment of a File 
Extents B-Tree data record is comprised of a sequence 
of extents descriptors of a particular file. The maximum 60 
number of extents descriptors in such a record can vary 
from implementation to implementation, but in the pre- 
ferred embodiment is set to three. The key of the File 



50 



55 



145 for file E provide the location information for the 
first 3 extents which in turn comprises the first 9 blocks 
(3+3+ 1) of the file. The location information for the 
remaining 13 blocks (2+3+1+7) of the file is con- 
tained in two data records 143 and 144 within the File 
Extents B-Tree. Assume that the desired data is at file 
relative block position 13 within file E. The extent de- 
scriptors contained in the file's catalog record are exam- 
ined first Since relative block 13 is greater than the 
number of blocks located by the extent descriptors in 
the file's catalog record, the File Extent B-Tree is 
searched. The key used for the B-Tree search for rela- 
tive block position 13 is <20,13>. 

Since the key value of "13" is greater than the value 
"9" of key 148 for the first Files Extents B-Tree record 
143 for file E and is less than the value "13" of key 149 
for the second record 144, the search results with a "not 
found" result but positions to the second B-Tree record 
144. By retrieving the previous record 143 of key 148, 



Extents B-Tree record consists of two fields: the file . „ _ , 

number of the particular file and the file relative posis- 63 the extent descriptor for relative block 13 is obtained. 

tic* of the starting block of the first extent descriptor in The value of "9" for key 148 is derived because extents 

that record. These extents records are kept in the leaf list 146 starts at the tenth retative block (allocation unit 

nodes of the Extents B-Tree sorted in ascending order number 9). The value of "15" for key 149 is derived 
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because extent* list 147 starts at the sixteenth relative 
block (allocation unit number 15). 

IMPLEMENTATION 

The HFS of the present invention is implemented in a 3 
computer which is coupled to a memory device, such as 
a disk, having an ability of storing millions of bits of 
information, although any storage medium can use the 
HFS. Typically, the HFS of tht present invention pro- 
vides the cataloging of various groupings of data, such >° 
as files, which are stored on the disk. 

Tbs preferred embodiment implements data storage 
by to: use of a cataloging structure previously de- 
scribed to cawlog dtU stored on a large capacity mem- 
ory device. It also m«i«t«in« a file extents record of up - 
to three extents per file in the catalog. Subsequent ex- 
tent information is stored in a separate file extents re- 
cord. Both the catalog record and the extents record are 
maintained using two B-Trees of the earlier described w 
B-Tree structure. 

The HFS as described in the preferred embodiment is 
controlled by a combination of hardware and software 
in a computer system. The HFS controlling routines are 
stored in a separate storage device than the device used 2] 
for storing the actual data. The preferred embodiment 
stores the routines in a read only memory (ROM), al- 
though most any storage medium may be used. 

Thus, a hierarchical filing system for use with a large 
capacity memory device in described. 30 

We claim: 

1. In a computer, a bierarchial filing system to pro- 
vide cataloging and retrieval of data stored on a storage 
device, said bierarchial filing system comprising: 
a memory for storing a program for said cataloging 3J 

and retrieval; 
a processor coupled to said memory and said storage 
device for processing an organizing means to cata- 
log and retrieve said data; said processor compris- 
ing; 40 
said program for organizing said data on said storage 
device into a hypothetical catalog which has a root 
directory, a plurality of branching directories ar- 
ranged at various s ub sequ en t levels from said root 
directory, wherein some of said branching directo- 45 
ries branch from other of said branch directories; 
said branching directories being interconnected 
such that for each of said branching directories 
there is only a singular path from itself to said root 
directory, and wherein some of said branching so 
directories have at least one file, each file corre- 
sponding to a represen tation of a predetermined 
portion of said stored data; 
an swigging means for assigning a unique identifica- 
tion value to said root directory and each of said js 
branching directories, and assigning an identifica- 
tion name to each of said files, root directory and 
branching directories, wherein each of said branch- 
ing directories and files are each provided with a 
key comprised of its identification name and its to 
next higher level directory identification value; 
s list forming means for forming a linear list of files 
and directory entries such that said file and direc- 
tory entries are ordered by said keys, such that said 
root directory being the highest level and files «$ 
heing the lowest level; and said interconnection or 
each of said singular path is provided by each file 
and branching directory identification name being 



associated with directory identification value of its 
next higher level; 
a structure fonning means for forming a B-Tree in- 
dexing structure having a beginning node, a plural- 
ity of indexing nodes and a plurality of terminating 
nodes, and wherein said linear list is stored in said 
terminating nodes of said B-Tree indexing struc- 
ture. 

2. The hierarchial filing system defined in claim 1, 
wherein said memory for storing said program is a read 
only memory. 

3. In a computer system where data is to be cata- 
logued when stored into a memory device, a method 
performed by the computer system for providing a 
hierarcliial filing system to catalogue said date into a 
volume of said memory device for subsequent retrieval, 
comprising the steps of: 

creating a root directory, a plurality of subdirectories 
and a plurality of files; 

organizing said root directory, subdirectories and 
files into a hypothetical catalog wherein said root 
directory is at a topmost level and said subdirecto- 
ries are arranged at various s u bseque n t levels from 
said root directory, some of said subdirectories 
branch from other of said subdirectories, but said 
subdirectories being interconnected such that for 
each of said subdirectories there is only a singular 
path from itself to said root directory, and wherein 
each of said files being interconnected to branch 
(root a certain one of said subdirectories only, such 
that for each file there is only a singular path from 
itself to said root directory; 

fMigwiwg « unique numerical directory identification 
value to said root directory and to.each of said 
subdirectories m said volume; 

assigning an identification name to said root directory 
and to each of said subdirectories and files, such 
that no two subdirectories branching from a root 
directory has a same name, no two subdirectories 
branching from another subdirectory has a same 
Mm* , and no two files branching from one of said 
directories has a same name; 

wherem each of said subdirectories and files are each 
provided with a key comprised of its identification 
name and its next higher level directory identifica- 
tion value; 

forming a linear list of files and subdirectory entries 
such that said file and subdirectory entries are or- 
dered by said keys, such that said root directory 
being the highest level and files being the lowest 
level; and said interconnection of each of said sin- 
gular path is provided by each file and subdirectory 
identification name being associated with directory 
identification value of its next higher level; 

forming a B-Tree indexing structure having a begin- 
ning node, a plurality of indexing nodes, and a 
plurality of terminal nodes; 

storing said linear list in said terminal nodes of said 
B-Tree structure in alphanumerical order accord- 
ing to said numerical directory value; 

assigning said identification name of a given file to a 
respective portion of said data; 

storing said data; 

placing memory location information in said files, 
wherein for each given file its memory location 
information locates its respective portion of said 
data stored in said memory device. 
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4. The method as described in cUim 3 wherein said 
step of forming said B-Tree indexing structure further 
comprises the step of forming a B-Tree structure 
wherein said beginning node comprises a root node of 
said B-Tree, said indexing nodes comprise branch nodes 
of said Be-Tree, and said terminal nodes comprise leave 
nodes of said B-Tree. 

5. The method as described in claim 4 wherein said 
step of placing location information in said flies com- 
prises the step of providing a plurality of extent point- 
ers, each extent pointer pointing to a location of a por- 
tion of said data stortu in s»id memory device corre- 
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sponding to said given file such that non-contiguous 
data segments are made to correspond to each said file. 
6. The method as described in claim 5 further com- 
prising the step of forming a second B-Tree structure to 
store a linear list of additional extent pointers for those 
files which have more extent pointers than that which 
can be stored to each file, said linear list of additional 
extent pointers being stored in terminal nodes of said 
second B-Tree structure by having each additional ex- 
tent pointer stored in one of said terminal nodes. 
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